What's in a p-value in NLP?

نویسندگان

  • Anders Søgaard
  • Anders Johannsen
  • Barbara Plank
  • Dirk Hovy
  • Héctor Martínez Alonso
چکیده

In NLP, we need to document that our proposed methods perform significantly better with respect to standard metrics than previous approaches, typically by reporting p-values obtained by rankor randomization-based tests. We show that significance results following current research standards are unreliable and, in addition, very sensitive to sample size, covariates such as sentence length, as well as to the existence of multiple metrics. We estimate that under the assumption of perfect metrics and unbiased data, we need a significance cut-off at ⇠0.0025 to reduce the risk of false positive results to <5%. Since in practice we often have considerable selection bias and poor metrics, this, however, will not do alone.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effects of Linear and Nonlinear Periodized Resistance Training on some of Kidney Functional Parameters in Women with Abdominal Obesity

Introduction: The aim of this study was to determine the effects of linear and nonlinear periodized resistance training on serum levels of creatinine, glomerular filtration rate (GFR), and creatinine clearance in women with abdominal obesity.    Materials & Methods: This study was conducted on 42 women who were randomly assigned into linear periodized (LP) resistance training (N=15, age: 40.4...

متن کامل

اثربخشی آموزش گروهی برنامه‌ریزی عصب زبان‌شناختی بر میزان امید و کیفیت زندگی کودکان سرطانی

Objectives This study aimed to examine the effect of Neuro-Linguistic Programming (NLP) on the hope and quality of life in children with cancer.  Methods The study design is quasi-experimental study with pretest, posttest, follow-up and control group. Study population consisted of children (male and female) with cancer at AminrKabir Hospital and Tabassom Cancer Support Community in 2016 who ap...

متن کامل

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

Comparing the Effectiveness of Training Cognitive Behavioral Therapy and Neuro-linguistic Programming Strategies on Enhancing Resilience of High School Students in Kerman, Iran

Background The aim of the present research was to compare the effectiveness of training cognitive behavioral therapy and Neuro-linguistic programming (NLP) strategies on mitigating anxiety, depression, and stress of students. Materials and Methods: The method of this semi-experimental research was pretest posttest with control grou...

متن کامل

Automated chart review utilizing natural language processing algorithm for asthma predictive index

BACKGROUND Thus far, no algorithms have been developed to automatically extract patients who meet Asthma Predictive Index (API) criteria from the Electronic health records (EHR) yet. Our objective is to develop and validate a natural language processing (NLP) algorithm to identify patients that meet API criteria. METHODS This is a cross-sectional study nested in a birth cohort study in Olmste...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014